Animate Vision in a Rich Environment
نویسندگان
چکیده
Most research in computer vision has been directed towards minimalistic approaches, in which problems are addressed on how properties of the environment can be computed from as little information as possible. Although such approaches may be scientifically well motivated they have only resulted in limited progress towards our understanding of seeing systems. Ballard, Bajcsy and others have pointed out the importance of vision being an active process which is tightly connected to behaviors. We support this thought and also propose that utilizing that the world is rich on information is essential. We develop this idea to show how attention and figure-ground segmentation by an active observer using multiple cues can be separated from analyzing and recognizing what is seen in a consistent way. Continuous operation over time and early use of three dimensional cues are important in this context. We illustrate our proposed approach by some experiments on a real-time active system. 1 Introduction Vision is a sense by which seeing creatures acquire information about a dynamically changing environment and thereby guide many of their behaviors and actions. Computer vision research aims at understanding and developing computer based systems with such capabilities. Despite extensive efforts for more than three decades we still seem to be very far from such a goal. Although there exists ample knowledge of how information about the environment can be computed from visual cues, we see little progress towards what can be called seeing systems. has pointed out that the major reason for this is that vision, as we know it from biology, is an active process and that traditional computer vision approaches take no heed of this fact. Ballard [1989, 1991] analyses this discrepancy and its consequences. Here we will further Ballard's arguments and discuss some additional issues which we believe are crucial. We will also report on recent progress towards the realization of animate vision systems. Emphasizing the strong ties between vision and behaviors Ballard particularly considers the need for gaze control and what he terms "quickly computable features"1. The point that we want to stress in this context is that the real world is rich on information and that a multitude of such features can be computed when needed. We will argue that this suggests a paradigm of attentional mechanisms coupled to possibly independent mechanisms for deriving scene characteristics and information about objects. Notably this implies that the environment itself influences what should …
منابع مشابه
Twelve Issues for Cognitive Science
I am struck by how little is known about so much of cognition. One goal of this poper is to argue for the need to consider a rich set of interlocking issues in the study of cognition. Mainstream work in cognitiorr-including my ow+ignores many critical aspects of animate cognitive systems. Perhaps one reason that existing theories say so little reievant to real world activities is the neglect of...
متن کاملDissolve Detection in a video sequence based on Animate Vision∗
The objective of video segmentation is to segment a video sequence into parts called shots corresponding to a continuous set of frames taken from one camera. Transitions between shots can be abrupt (cuts) or gradual. Abrupt transition can be easily detected while detection of gradual transition, such as dissolve, in still an unsolved problem. In this paper we present a novel approach for a reli...
متن کاملOperating System Support for Animate Vision
Animate vision systems couple computer vision and robotics to achieve robust and accurate vision, as well as other complex behavior. These systems combine low-level sensory processing and effector output with high-level cognitive planning-all computationally intensive tasks that can benefit from parallel processing. A typical animate vision application will likely consist of many tasks, each of...
متن کاملRobot Motion Vision Pait I: Theory
A direct method called fixation is introduced for solving the general motion vision problem, arbitrary motion relative to an arbitrary environment. This method results in a linear constraint equation which explicitly expresses the rotational velocity in terms of the translational velocity. The combination of this constraint equation with the Brightness-Change Constraint Equation solves the gene...
متن کاملReference Frames for Animate Vision
Animate vision systems have gaze control mechanisms that can actively position the camera coordinate system in respose to physical stimuli. Compared to passive systems, animate systems show that visual computation can be vastly less expensive when considered in the larger context of behavior. We are accustomed to thinking of the task of vision as being the construction of a detailed representat...
متن کامل